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Abstract 

Let A G f2 n be doubly-stochastic n x n matrix. Alexander Schrijver proved in 
1998 the following remarkable inequality 

per(A)> J] (l-A(i,j));A(i,j)=:A(i,j)(l-A(i,j)),l<i,j <n (1) 

l<i ,j<n 

We prove in this paper the following generalization (or just clever reformulation) 
of®: 

For all pairs ofnxn matrices (P,Q), where P is nonnegative and Q is doubly- 
stochastic 

log(per(P))> J2 log(l-Q(i,mi-Q(i,j))- ]T Q(i, j) log (^)-) (2) 

l<i,j<n l<i,j<n ^ \ l i3)/ 

The main co rollary of ([2]) is the following inequality for doubly-stochastic matrices: 
P -^->l;F(A)=: ft (1 - A{i,j)f~ A ^ . 

We use this inequality to prove Friedland's conjecture on monomer- 
dimer entropy, so called Asymptotic Lower Matching Conjecture 
We present explicit doubly-stochastic nxn matrices A with the ratio p< p^ = V?" 
and conjecture that 

max^4WV2r. 



Aen n F(A) 

If true, it would imply a deterministic poly-time algorithm to approximate the per- 
manent ofnxn nonnegative matrices within the relative factor \V2) . 



*gurvits@lanl . gov. Los Alamos National Laboratory, Los Alamos, NM. 
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1 The permanent 



Recall that a n x n matrix A is called doubly stochastic if it is nonnegative entry-wise 
and its every column and row sum to one. The set of n x n doubly stochastic matrices 
is denoted by fl n . The set of n x n of row stochastic(i.e. when every row sum to one) 
is denoted by RS n , the set of column stochastic(i.e. when every column sum to one) is 
denoted by CS n . 

Let A(k, n) denote the set of n x n matrices with nonnegative integer entries and 
row and column sums all equal to k . We define the following subset of rational doubly 
stochastic matrices: Qk,n = {k~ l A : A G A(k,n)}. 

Recall that the permanent of a square matrix A is defined by 

n 

per(A) = U A (h<y(i))- 

The following inequality was conjectured by B.l. van der Waerden in 1926 and proved 
independently in 1981 by D.L. Falikman [15] and G.P. Egorychev |14j : 

Til 

min per(A) = — =: vdw(n). (3) 

1.1 Schrijver- Valiant Conjecture and (main) Schrijver's perma- 
nent al inequality 

Define 

X(k,n) = mm{per(A) : A G fifc,„} = k~ n mm{per(A) : A G A(k,n)}; 
0(k) = \im n ^ 00 (X(k,n))n. 

It was proved in [2] (also earlier in [TJ) that, using our notations, 9(k) < G(k) =: 
(nsr) fc-1 an d conjectured that 9{k) = G{k). Though the case of k = 3 was proved by 
M. Voorhoeve in 1979 [23J , this conjecture was settled only in 1998 [3J (17 years af- 
ter the published proof of the Van der Waerden Conjecture). The main result of [3] (as 
many people, including myself, wrongly thought) is the remarkable (Schrijver-bound) : 

/ k — 1 \ ^ k ^ n 

mm{per{A) : A G Q k , n } > I — — J (4) 

The bound (jlj) is a corollary of another inequality for doubly-stochastic matrices: 

per(A)> [] (l-A(iJ));Aen n ;A(iJ)=:A(iJ)(l-A(i,j)),l<i,j<n. (5) 

l<i ,j<n 
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The proof of ([5]) in [3] is, in the words of its author, "highly complicated" . Surprisingly, 
the only known to me application of fl5]) is the bound (j3j), which applies only to "very" 
rational doubly-stochastic matrices. The main goal of this paper is to show the amazing 
power of (J3J), which has been overlooked for 13 years. 

2 A Generalization of Schrijver's permanental in- 
equality 

We prove in this section the following theorem, stated in [12] in a rather cryptic way.Fortunately, 
the paper cites [13] and M. Chertkov is my colleague in Los Alamos. 
The statement in the current paper has been communicated to me by Misha Chertkov, 
to whom I am profoundly grateful. 

Definition 2.1: Define for a pair (P,Q) of non-negative matrices the following func- 
tional: 

CW(P,Q)=: £ log(l-Q(i,j))(l-Q(t,j))- £ Q(<,j)togC|my (6) 

(Note that for fixed P the functional CW(P,Q) = Yti<u< n Fi,j{Q(hj)) an d ^ij(O) = 0. 
If Q G Q n is doubly- stochastic and P = Diag(a\, ...,a n )TDiag(bi, ...,b n ) then 

CW(P,Q)= £ log(a^) + CW(T,Q). (7) 

l<i<n 

Therefore, WLOG we can consider only doubly- stochastic matrices P. 

The functional CW(P,Q) is concave in P and, rather surprisingly (see the 2011 arxiv 

version of flty/). concave in Q e Q n . 

I 

Theorem 2.2: Let P be non-negative nxn matrix. IfPer(P) > then max Qen n CW(P,Q) 
is attained and 

log(Per(P)) > max Q€nn CW(P, Q) (8) 

(It is assumed that 0° = 1.) 

An equivalent statement of this theorem is 

log(Fer(P))> ]T log(l-g(z,j))(l-Q(z,j))- £ Q{iJ) log (j&l) ■ P > 0, Q 6 O n 

(9) 
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Proof: We will prove, to avoid trivial technicalities, just the positive case, i.e when 

P(i,j)>0,l<i,j<n. 

We compute first partial derivatives: 



_d_ 

dQ 



CW(P,Q) = {-2-log(l-Q(i,j))-log(Q(i,j))+log(P(i,j)) :l<i,j<n} (10) 



In the positive case, i.e. for the fixed positive P, the functional CW(P, Q) is bounded and 
continuous on Q n . Therefore the maximum exists. Let V G Q n be one of argmaximums, 
i.e. 

CW(P, V) = max Qenn CW(P, Q). 
Then, after some column/row permutations 



V 



( V 1A ... x 
o v 22 o ...0 



V o 



o V kik J 



Pl,2 



V Pk,k J 

The diagonal blocks are indecomposable doubly-stochastic di x di matrices; 
J2i<i<k di = n and 1 < k < n. Clearly, 

cw(p,v)= ]T cw(p hU v ht ). 
i<i<k 

As log (per (P)) > J2i<i<k \°g(P er (Pi,i)) it is sufficient to prove that 

log(Per(P M )) > CW(P hl , V hl ); 1 < i < k. 

For blocks of size one, the inequality is trivial: (1 — — 1 l°g(~) — log(a). 
Consider a (indecomposable) block V^i of size di > 2 and define its support 

Supp(V hl ) = {(k,l):V hl (k,l)>0}. 

Note that 1 > V iti {k, I) > 0, (k, I) £ Supp(V.^i). Consider the following functional 

'w hl (k,iy 

vv i:i yK, i) lug | 

(k,i)eSupp(Wi,i (k,i)eSup P (Vi,i 



L(W hl )=: J2 log(l-Wi,i(k,l))(l-Wi,i(k,l))- J2 W i7i (k,l)log( ] - 

(k.DPSuvvfWi i (k,l)€Suvv(V< i V 



P(i,j) 



defined on compact convex subset of doubly-stochastic matrices which are zero outside 
of Supp(Pij). We conclude that the functional L() is differentiable at V^i. Note that 
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L(V i>i ) = CW(P i>i ,V iti )). 

We now can express the local extremality condition not on full but rather on its 
compact convex subset of doubly-stochastic matrices which are zero outside of Supp{Pi 4 ). 
Using ffTUj) and doing standard Lagrange multipliers respect to variables V^/c, I), (k, I) G 
SuppiVi^), we get that there exists real numbers (o^; j3i) such that 

-2 - log(l - Vi,i(k, I)) - \og(V i4 (k, I)) + log(Pi,i(k, I)) =a k + l : (k, I) G Supp{V^). 

Which gives for some positive numbers ak,bi the following scaling: 

P i4 (k, I) = aAVi^k, - V hl (k, /)); (k, I) G Supp(V i4 ). (11) 

It follows from the definition of the support that 

1. 

Pi,i > Diagia^Diagik); V iti (k, I) = V i:i (k : - V^k, I)). (12) 
2. Using the scalability ([7]) property, we get that 

CW{P iti ,V i , i ) = J £log{a k )+ J £]Dg{bi)+ £ log(l-U M (M)- (13) 

(k,i)eSupp(Vi,i) 

Finally it follows from (fTBl and Schriver's permanental inequality ([5]) that 
log(per(Diag{a h )%Diag(bi)) > CW {P^V^- 

and that 

log(per(P M )) > ]o%{jper{Diag{a k )%Diag{bi)) > CW(P M ,7 M ). 

I 

Remark 2.3: Note that the proof does not use concavity of CW(P, V) in V G f2 n . I 

3 Corollaries 

1. Schrijver's permanental inequality fl5]) is a particular case of fl9]). Indeed 

CW{V,V)= £ log(l-V(i,j)):VeQ n . 

l<i,j<n 
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2. Let P G Vt n be doubly-stochastic n x n matrix. Then 

log(per(P))>CW(P,P)= £ log(l - P(i, - P(i, j)). 

l<i,j'<n 

We get the following important inequality, perhaps the main observation 
in this paper: 

^±>l;F(P)=: J] {l-Pfrfif-^lPen* (14) 

V / l<i,j<n 

The lower bound f lbT|) suggests the importance of the following quantity: 

ttu( \ per(P) 
UB(n) =: max PG n n . 

It is easy to show that the limit 

UB=: ]im(UB(n))n 

exists and 1 < UB < e. There is obvious deterministic poly-time algorithm to 
approximate the permanent of nonnegative matrices within relative factor UB(n). 
The current best rate is e n . Therefore proving that UB < e is of major algorithmic 
importance. 

Remark 3.1: All previous lower bounds on the permanent of doubly-stochastic 

matrices PgA„ depend only on the dimension n and the support of P. I.e. the 
previous bounds are structural. The beauty (and potential power) of our lower 
bound dHJ is in its explicit dependence on the entries of P. We use ( ffij) in Section 
5 to settle important conjecture on the monomer-dimer entropy. I 

Example 3.2: I. Let P = aJ n + bl n ,a = 2( - n 1 _ 1 - ) , b = 2 0n'-i) ' ^' e ' ^ e diagonal 
P(i, i) — |, 1 < i < n and the off-diagonal entries are equal to 2 (n-\) ■ 
It is easy to see that for these (a, b): 

2~ n+1 < per(aJ n + bl n ) = n\a n V \{^\ < n\a n exp (-) . 



Non-difficult calculations show that for this P G Q n 

per(P) f fe s 

TiPj^ 



(15) 



II. Let P G fi 2 = \J2 be 2 x 2 "uniform" doubly-stochastic matrix. The direct 
inspection gives that 

CW(P, Q) = -2 log(2) = F(P), Q G Q n . 
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Consider now the direct sum P 2n £ VL 2n = \J 2 © ... © \J 2 - Then 

max Q£Q2n CW(P 2n , Q) = log(F(P 2n )) = -2nlog(2). (16) 
Therefore in this case 

per{P 2n ) _ on 

F(Pn) " ' [L7) 
Which gives the following lower bound on UB(k) for even k: 

UB{k) > (V2) k . (18) 

As maxQ G Q 2n CW{P 2n , Q) = \og(F(P 2n )), this class of matrices also provides a 
counter-example to the non-trivial part of Conjecture 15 in [12]. 
Is the bound (I18j) sharp? | 

3. Recall the main function from [8]: 

G(x) = (—-^j ,x>l. 
Note that for P £ fi n the column product 

CPfl,(P)=: n^-^i)) 1 ^'^^)' (19) 

l<i<n 

Define Cj as the number of non-zero entries in the jth column then 

CPRj(P) =: I] (1 - /'('•./))' > G(Cj). (20) 

l<i<n 

The inequality f lT^]) gives a slightly weaker version of the celebrated Falikman- 
Egorychev-van der Waerden lower bound ([3]): 

per(P) > [] CPRjiP) > (- — -) 

i<j<n V n / 

The inequality (120]) gives a non-regular real- valued version of (Schrijver-bound): 
per(P)> J] CPR 3 {P)> I] C(Q) (21) 

l<7< n l<jr'<n. 

In the worst case, the author's bound from [8] is better: 

per(P)> H G(mm(j,a,)) (22) 
i<j<n 

Perhaps, it is true that 
Conjecture 3.3: 

per(P) > [] G(min(j,ECj))7 

l<?'<n 

where the effective real-valued degree ECj = G~ 1 (CPRj(P)). | 
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4 Some historical remarks 



The column products CPRj(P) =: IIi<i<n(l ~~ P{h i)Y~ P > G(Cj) have appeared in 
the permanent context before. Let P = [a\b, .., \b] G O n be doubly-stochastic matrix with 
2 distinct columns. Then (Proposition 2.2 in [19]) 

Per(P) > CPR(l)vdw(n - 1). (23) 

Let us recall a few notations from [8j and [5]: 



1. The linear space of homogeneous polynomials with real (complex) coefficients of 
degree n and in m variables is denoted Homn(m,n) (Homc(m,n)). 

We denote as Hom + (m,n) (Hom ++ (n,m)) the closed convex cone of polynomials 
p G Homji(m,n) with nonnegative (positive) coefficients. 

2. For a polynomial p G Hom + (n,n) we define its Capacity as 

Cap(p) = inf p(x!, ...,x n )= inf ' P f 1 ' ' ' ' ' Xw . (24) 
^> dli<i<„^ =1 Xl>0 ni<i<n^ 

3. The following product polynomial is associated with a n x n matrix P: 

Prod P {x 1 , . . . ,x n ) =: n 51 p {hj) x r ( 25 ) 

l<i<n l<j<ra 

The permanent per(P) is the mixed derivative of the polynomial Prodp: 

" aiP) = Sxi s X2 ...s x , ProM0) - (26) 

4. 

d 

q(j) =■ Q^-Prodp(xi, ...,x n ):Xj = 0. 



Note that the polynomials qu\ G Hom + (n — 1, n — 1) 



For example, g (n) = -^Prod P (x u . . . , x n _ u 0). 



The following lower bound, which holds for all P G f2 n , was proved in [5]: 

Cap{q {j) ) >CPRj(P),l< j<n. (27) 

Combining results from |8j (i.e. Per(P) > vdw(n — l)Cap(q^),l < j < n) and (1271) 
gives a different version of ([1] 



per(P) > ( H CPRjiP)) vdw(n-l),P en n . (28) 
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Or better 

per(P) > (max^j^CPRjiP)) vdw(n -l),Pefl n . (29) 
Perhaps, it is even true that 
Conjecture 4.1: 

per(P)> J] Cap(q u) ),Pen n 7 

I 

A general, i.e. not doubly-stochastic and not just "permanent al" , version of Conjec- 
ture d. ID is the following one: 

Conjecture 4.2: Let p G Hom + (n,n) be H-Stable, i.e. p(zi, z n ) ^ if the real 
parts RE(zi) > 0, 1 < i < n. In other words, the homogeneous polynomial p does not 
have roots with positive real parts. Then the following inequality holds 

» /" p(0)>Cap(p) n ( 3 °) 
dxidx 2 ...dx n Cap(p) 

I 



5 Some Partial Results Towards the Main Conjec- 
ture^) 

Let us formalize the main new question in the following Conjecture. 
Conjecture 5.1: Let P G Q n be doubly-stochastic matrix. Is it true that 

1. "Optimizational" Conjecture 

per(P) < (V2) n exp(maxCW(P,Q)). 

Qen n 

It will be explained below that "Optimizational" Conjecture gives provable 
deterministic polynomial (but not strongly) algorithm to approximate per(P) with 
the factor (^/2) n 

2. Strong Conjecture 

per(P)<(V2TF(P),F(P)=: [] (1 ~ P(i, j)) 1 '^ ■ 

l<i,j<n 

Strong Conjecture obviously gives deterministic strongly-polynomial algorithm 
to approximate per(P) with the factor (y/2) n 
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3. Mild Conjecture 

per(P) < (V2) n (F(P)) c , 

where < c < 1 is some universal constant. The case c = \ seems believable. As 
per(P) > F(P) thus Mild Conjecture gives deterministic strongly-polynomial 
algorithm to approximate per(P) with the factor 

(F(P)) C ^ 1 <^ e n(1 ~ c) < e n . 

I 

5.1 Some Basic Properties of CW(P, Q) 

The "odd entropy" function OE(p) = p\og(p) — (1 — p) log(l — p), < p < 1 is not convex 
on [0, 1]. Yet, when lifted to the Simplex it becomes convex. This non-obvious result was 
proved in recent extended version of [12]. We present below a simpler and more general 
proof. 

Definition 5.2: Call a function / : [0, 1] — > R simplex-convex if the functional 
fsim(i){Pi, -,Pn) =■ f(Pi) + ••• + f{Pn) is convex on the simplex Sim n {\) = {(pi, ...,p n ) : 
Pi > 0, 1 < i < n; Ei<i<nPi = 1- ■ 

Clearly, if / is convex then it is simplex-convex as well. We describe a much wider 
class of simplex-convex functions. 

We need two simple facts. 

Fact 5.3: Let g : [0,1] — > R be convex function; g(0) = 0. Then this function g is 
super-additive: 

g(ti + ... + t n )> gih) + ... + g(t n ) : U > 0, 1 < i < n; t x + ... + t n < 1. 

Proof: Let U > 0, 1 < i < n; t\ + ... + t n — s < 1. Lift g to the simplex sSim n (l) =: 
Sim n (s) — {ti > 0, 1 < i < n\ t x + ... + t n = s}: 

g(t u ...,t n ) = g{tx) + ... + g(t n ). 

The functional g is convex on the simplex Sim n (s). Therefore, its maximium is attained 
at the extreme points, i.e at the vectors (s, 0, 0), (0, 0, s). As g(0) = we get 
that 

max g(ti, ...,t n ) = g(s), 

bim n (s) 

which finishes the proof. | 
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Fact 5.4: Consider the linear subspace Sum C R n+1 , Sum = {(x , x\, x n ) : 

Ho<!<n X i = 0- 

Let ao > 0; > 0, 1 < i < n. The diagonal matrix D = Diag(—a , at, a n ) is positive 
semidefinite on Sumo, i-e < DX, X >> 0,lG Sum® iff 

— < — ]on > 0, 1 < % < n. (31) 

l<i<n °» a 

Proof: The proof directly follows from the following easily checkable equality: 

-l 

mm a *Vi = [ 12 c ~ 



Theorem 5.5: Let f : [0, 1] — > R be continuous and twice differentiable on (0,1) func- 
tion. Define g(t) = /(| + t),t 6 [— |, |]. ylsstzme t/iat the second derivative satisfies 
the following properties: 



1. g&>(t) >0,0>t > -i;# (2) (0) > 0. 

2. gW(t) > -g®(-t),0< t<\. 

3. lim^_i = 0. 

4- The function ^y^y convex on [— ^,0). 
T/ien the function f is simplex-convex. 

Remark 5.6: The "odd entropy" function OE(p) = plog(p) — (l—p) log(l— p), < p < 1 

satisfies the above properties: 

Indeed, = OE{2 l { L +t) = -« + f • ■ 

Proof: As / is continuous it is sufficient to prove that /(to) + ••• + f{t n ) is convex in the 
interior of the simplex Sim n+ i(l), i.e when < ti < 1. Define <ij =: f^{U). We need to 
prove that Z) =: Diag(do, d±, ...,d n ) is positive semidefinite on Sumo. 
If £j < | then > and D is positive semidefinite. Otherwise, there is only one t« > |, 

say 

11 11 

t = - + s , - > s > 0; tj = - - Sj; - > > 0, < * < n. 

Note that ^ > 0,1 < i < n. If / (2) (t ) = # (2) ( s o) 

> we are done. Assume that 
-f3 =: f (2 \t ) = # (2) ( s o) < 0,/3 > 0. Our goal, using Fact([53]), is to prove that 

(/r 1 > E c 1 - (32) 

KKn 
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Note that ^ ^ 

9~ s o = E (9 - s i)- 

Z l<i<n Z 

Using the properties(2-4) above and Fact (15 .31) . applied to the convex function a(t) =: 
^y^y,t E [0, §], we get that 

Z l<i<n 

/ (5 + s o) < we get from property(2) above that 

P = -f (2 \\ + s )<f^\\-s«). 
Which gives the desired inequality f )32|) . I 

Remark 5.7: Recall the definition of the Bregman Distance associated with a convex 
functional /: 

< D f (X\\Y) = f(X) - f(Y)- < XJF Y ,X- Y > . 

For instance, the Kullback-Leibler Divergence is the Bregman Distance associated 
with 

/(pi, ...,p n ) = E Pi^SiPi)- 

l<i<n 

As we know that the "odd entropy" functional 

OE(px,-,Pn))= E Pi^iPi) - (! -Pi)log(l -Pi) 

l<i<m 

is convex on the simplex Sim n (l), we can define a new divergence, which we call Bethe 
Divergence: 

BD(X\\Y)= E (x l log(^)-(l-x t )log(^^]Y,X,Y e Sim n (l). (33) 

It would be interesting to investigate statistical (or learning) applications of the Bethe 
Divergence. | 

5.2 Some easy exact computations of maxQ e o n CW(P, Q) 

The following fact is easy corollary of the simplex-convexity of the "odd entropy" 
function OE{p) = plog(p) — (1 — p) log(l — p), < p < 1. 

Fact 5.8: 
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1 



Let Pi > 0, 1 < i < n be a positive vector, n > 2. Define 



OD(q,p)= {I - Qi) \og{l - qi) - qilog(—),q <E Sim n (l). 




Then 



max 

(qi,...,q n )£Sim„(i) 



OD(q,p) = log(pj) 



iff Pj > J2i^jPi- We call such index j dominant. 

Note that if n > 3 then there exists at most one dominant index. 

If there is no dominant index then the maximum is attained in the interior of the 

simplex Sim n (l). 

2. Let Pi = const > 0, 1 < i < n . Then 

max OD(q,p) = OD( — ,p) = (n — 1) log(l — nT 1 ) + log(n) + \og(const). 

(qi,...,q n )<=Sim n (l) fl 

Remark 5.9: The first item of Fact (I5.8P says that for n > 3 the extremum of OD(q, p),p > 

is either an extreme point of the simplex(when the unique dominant index exists) or 
a point in the interior. This is in stark contrast with i^LD-minimization, where the 
extremum has largest possible support. I 

We will take advantage of the following corollary. 

Corollary 5.10: Let RS n denote the set of n x n row-stochastic matrices. 

Let P be n x n diagonally dominant non-negative matrix, i.e. P(i,i) > J2j^iP(hj)] 1 < 

1 < n. Then 



The following observation follows now from the scalability property ([7]). 

Corollary 5.11: Assume that there exist two diagonal matrices Di,D% such that the 
matrix P = D\AD 2 is diagonally dominant, i.e. B(i,i) > B(i, j) . Then 




CW(P,Q)= £ log(P(M)). 

Ki<n 



(34) 



maxCW(P,Q)= £ log(P(M)). 
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5.3 Regular Bipartite Graphs 



Let RB(r, n) denote the set ofnxn boolean matrices with row and column sums all 
equal to r. Note that if A G RB(r,n) then ^A is doubly-stochastic and 

F(-A)=( =G{r)\ 

r V r J 

The celebrated Bregman's upper bound [16] gives that 



Therefore 




Therefore, Strong Conjecture holds on the sets RB(r,n). 

Let CO(^RB(r,n)) be the convex hull. It follows from linearity of the permanent in 
individual rows that 

per(-A) < | — | = A e CO(-RB(r, n)). 
r \ r J r 

The following observation(most likely known) follows fairly directly from the classical 
J.Edmonds' result that the intersection of two matroid polytopes is the polytope of the 
intersection of the corresponding two matroids with the same ground set. 



Proposition 5.12: The convex hull 

CO{-RB{r,n)) = {A e Q n : A(i,j) <-;l<i,j< n}. 



Corollary 5.13: 

CO(-^—RB(r + 1, n)) C CO(-RB(r, n)), 1 < r < n - 1. 



We only can state (rather trivial) upper bound 

It follows from ( )35ll that Strong Conjecture holds on CO(^RB(r,n)),r > 6. 
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5.4 Diagonally Dominant Matrices 
Lemma 5.14: Let A be n x n non-negative matrix. Then 

Per(A)< J] (A(i,i) 2 + (J2A(iJ) 2 ))^. (36) 

l<i<n jj^i 

Proof: Follows from linearity of the permanent in individual rows and the following 
generalized Holder's inequality 

n «,+ n y< n (kr+M n )»- (37) 

l<i<n Ki<n Ki<n 



Corollary 5.15: If A is Diagonally Dominant then the "Optimizational" Conjecture 

holds, i.e. 

per(A) < (V2) n exp(max CW(A,Q)). 

71 

6 A proof of Friedland's Asymptotic Lower Matching 
Conjecture 

6.1 Two models for random regular bipartite graphs with mul- 
tiple edges 

We denote as RI(r, n) the set of n x n non-negative integer matrices with row and column 
sums all equal r: 

RI(r,n) = {{A(i,j);l <i,j<n}\ A(i,j) e Z + ; Ae = A T e = re}. 

1. The Pairing Model: Consider a random, respect to uniform distribution, per- 
mutation ii G S rn of length rn and its standard matrix representation, pictured as 
a block matrix: 



/ M w (l,l) M T (1,2) ... M w (l,r) 
V M w (r,l) M T (r,2) ... M w (r,r) 



where each block is a (boolean) n x n matrix. The Pairing Model for a random 
matrix in RI(r, n) corresponds to a random matrix BM(r, n) =: J2i<ij< r 
This model was used in the context of the permanent in [2]. 
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2. The sum of r independent permutation matrices: Another model is just the 
sum of r independent permutation matrices: 

HW(r,n) =: £ M CTi , 

l<i<r 

where &i <E S n ,l < i < r are independent uniformly disributed permutations of 
length n. This model was used by Herbert Wilf [I]. As in [2], the main goal and 
result of [1] was the asymptotics of the expected value of the permanent: 

lim (E(per(HW(r, n)))» = lim (E(per(BM(r, n)))« = rC(r). (38) 

n— >oo n— >-oo 

It is worth noticing that the proof in [1] is much more involved than in [2]. One of the 
corollaries of ( 158"]) is the following inequality 

lim ( min (per(A))™ < rG(r), (39) 

n— s-oo AdRI(r,n) 

which was proved much later to be equality. 

Let probi(r,n) be the probability of the event BM(r,n) G RB(r,n), where RB(r,n) is 
the set of n x n boolean matrices with r ones in each row and column; prob2(r, n) be the 
probability of the event HW(r,n) G RB(r,n). Brendan McKay conjectured in [26J that 
for fixed r(we present here a simplified expression) 

/ (r-1) 2 \ 
prober, n) = exp f V 0(n x ) j . (40) 

This conjecture was proved almost 20 years after in [23], moreover it holds for r = o(y/n). 
The proof in [26] is rather involved and has nothing to do with the permanent. 
On the other hand, it is easy to see that 

— — — - TT min per(A) < prob2(r,n) < ——- — - TT max per(A). (41) 

We can use now various lower bounds on min^ ei?S ( n _j n ) per(A) and the Bregman's upper 
bound ((n - z)!)^ on max AeJJB(n _ iin) per(A). 

Using just the Van Der Waerden-falikman-Egorychev (or even Bang-Friedland) bound 
we get that 

II (— T< prober, n)< J] ^ ~ ^ (42) 



KKr-l H Ki<r 



(n — % + l)...n 



The best current lower bound ( 122]) gives 

II G(» -O* / (W ~?' <pro& 2 (r,n)< [] /^T ^ (43) 
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For a fixed r, as (142 p as well (143]) give the following asymptotic for prob2{r, n) 

r(r — 1m 



prob2(r, n) w exp 



2 



which is less than (140]) . 

Ian Wanless noticed in 1241 that 



min per(A)<(probi(r,n)) l E(per(BM(r,n)). 

AeRB(r,n) 



Together with (140]) it implies that 

lim ( min (per(A))~) < rG(r), (44) 

n— >oo A£RB(r,n) 

which is the main conclusion of [23]. We sketched above an alternative, simpler way 
to get the same result by combining Herbert Wilf's 1966 paper and Van Der Waerden- 
Falikman-Egorychev Inequality and their recent refinements. 



6.2 Monomer-Dimer Problem 



Let per m (A) denote the sum of permanents of all m x m sub matrices of A: 

per m (A) =: P er { A s,T)- 

\S\ = \T\=m 

Define the following two quantities 

EMD X (r, n; m) = E(per m (BM(r,n))), EMD 2 {r,n;m) = E(per m (HW(r,n))). 
A rather direct generalization of derivations in [2] and pQ gives the following asymptotics 

\og(EMD-i (r, n\ m)) , . .r\ . . , . . . . i. 

lim 11 n = g r (t) =: Hog - -2 log (1-f )+(r-t) log 1— , 

n^oo,^-s>te[o,i] n i r 

and 

lim M^Ml^l) = 9r(i) . 
n->oo,2i-rte[o,i] n 

It follows that 

lim mi ^M)log(per m (A)) < 

n->oo,S-»-ie[0,l] n 

The Wanless argument gives the same inequality for the boolean case 

lim ^A mR B {r , n) log(per m (A) < 

n->cc,22-rte[0,l] n 
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The Friedland's Asymptotic Lower Matching Conjecture asserts (after [2], [3]) that 

lim ™^ m r, n) log( P er m (A)) > 

n-Kx>,f ->t€[0,l] n 

We prove in this paper a slightly stronger result: 

lim ™^(r,n)log(per m (A) > 
n-»-oo,^->te[o,i] n 

Of course, as we explained above using Wanless argument, the inequalities (>) in (pETf 
HHJ) imply equalities. 

The Lower Matching Conjecture asserts that 

per m (A)> D(r;m,n)=: ( U ) (Lzl\n(r-t) u r y A £ RB u n \ t =■ ™ (49) 

J r n 

We prove in this paper the wollowing weeker inequality but for more general class of 
matrices, i.e for A e RI(r, n): 

/r-t\n(r-t)H _ n -l^(l-n- 1 )2n 2 (l-t) 

per m (A) > SF(r, n, m) =: r (i)n ^_ 2n(1 ^ )((ri(1 _ t)) , )2 (50) 



We note that 

D(r;m,n) I G(n) 



SF(r,n,m) \G(m + l)...G(n) 
where G(x) = {^-) x ~ l ,x > 1. 



> 1, m < n, (51) 



The following simple Fact will be used below. 
Fact 6.1: 

1. Define the following function G(x,t) = (^) :c_ ',x > t > 0. For a fixed t > the 
function G(x, t) is decreasing in x. 

2. Let (ai, ...,ak be positive numbers, J2i<i<k a i — 1- Then 

II (! - ^) 1_tol > G(fc, t); < t < 1. 

l<i<fc 

Theorem 6.2: Let ^4 G RI(r, n). For a positive integer m < n define 

m t 
t = — , a = -. 

n r 

Then the following lower bound holds: 
(Notice that (1 - a)( 1 - Q V = G(r,t) n .j 



18 



Proof: 



1. Step 1. 

Consider the following In — m x In — m matrix 

jy- I (lA bJ n n — m \ fKO'S 

K ~ { (bJ n , n - m ) T J' ^ 

where a = a = -, b = -, and J n>n _ m is n x n — m matrix of all ones. 
It is easy to check that this matrix K is doubly-stochastic. Importantly, the fol- 
lowing identity holds: 

P er ™(A) = am6 2(n-!n) ( ( ( j_ m)!) 2- ( 54 ) 

2. Step 2. 

We apply the inequality (|T4l) to the doubly-stochastic matrix K 



f \ / 1 \ (l- ± )2n 2 (l-t) 

per(K)> J] (l--A(z,j)) (1 ^ )A(ij) " • (55) 



, t \ n 



3. Step 3. 

Let dj be the number of non-zero entries in the jth column of A. Notice that dj < r 
and Y,A(i.j)^o A ^ = 1- It follows from Fact (16. 1 It that 

]J (l--A(i,j)) r > 1] G(dj,t)>G(r,t) n (56) 



T 

l<i,j<n 1<J<" 

Which gives the following lower bound on the permanent of K: 

per(K) > G(r, t) n (l - I)(i~)2" 2 (i-*) (57) 

n 

4. Step 3. 

Finally, we get ( 152]) by combining the (nontrivial, new) inequality ( 1571) with the 
(trivial, well known) identity ( )54|) . 



Remark 6.3: We can express SF(r, n, m) in terms of the function G, G{x) = (~^") , x — 

' , _ (1 - a)^)"- ^(n) 2 ^ 1 -*) 

^ ^ mj " a"' (G(l)...G(n(l - t)))2(l - t)^~t) 
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The following more general result is proved in the very same way. 



Theorem 6.4: Let Pefl B . Then 

(rii<i<„(l - ^PHij)) 1 -"^) G(n) 2( - n ^ 

P m[ > ~ )m n -2n(l-t)((„(l - t))!)2 ' ^ 

Using Fact (16. 1| ) one can get various corollaries of Theorem (16. 4 p expressed in terms of the 
support of doubly-stochastic matrix P. 

Corollary 6.5: Fix a positive integer r and consider a sequence of pairs (n, m) such that 

m 

n ->■ oo, >t e (0, 1). 

n 

Then 

log(SF(r,n,m)) ^ = f r _ _ _ + _ _ t 

n t r 

Together with inequalities \4ty this solves Asymptotic Lower Matching Conjecture 
log(min AgRJ?(r;n) per m (A)) = log(minAgflf(r,n) per m (A)) = 

n— >oo 77, n— >oo 9r\ ) 

Proof: We only need to prove (|60|) . The proof follows either from the Stirling approx- 
imation of the factorial or from the representation (|58|) . using the well known fact that 
lim^oo G(n) = e' 1 . 



Remark 6.6: 

1. The representation \ = Yli<i< n ^(*) provides very simple derivation of the Stirling 
formula. 

2. The first published statement of Asymptotic Lower Matching Conjecture appeared 
in [TTJ.The author learned about the statement of flBTl) from Shmuel Friedland in 
2005. 

The main result of [7] (and of 2006 arxiv version) was the limit equality flBTT) for 
t = s = 0,1,2,.... The fairly self-contained and simple proof in [7J was based 
on the "hyperbolic polynomials approach" introduced first in [19]. The actual re- 
sult in [7j was stated in terms of sums of mixed derivatives of general positive 
hyperbolic polynomials (the same as H-Stable in jH]), albeit for a restricted range 
of the parameter t. The proof in the present paper is not general at all, it works 
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only for the m-permanent, i.e. for the class of polynomials Sym m (yi, y n ), where 
yi are linear forms with non-negative coefficients. But in this case the full range of 
densities t G [0,1] is covered. 

Whether it can be generalized to general H-Stable polynomials remains open. 
Our proof of Asymptotic Lower Matching Conjecture illustrates once more how 
badly had the "Bethe Restatement" of Schrijver's inequality fl5]) been overlooked. 
The author did some search on Google Scholar and found, to his amazement, that 
the Bethe approximation is one the oldest heuristics for the monomer-dimer prob- 
lem, goes back to 1930s. So, the recent Bethe Approximation approaches a heuris- 
tic) to the permanent is, in a way, a rediscovery. Apparently, the first recent pub- 
lication in this direction was [T8] . 

How cool is it that this classical statistical physics stuff was one of the main keys to 
rigorously settle the Asymptotic Lower Matching Conjecture! Of course, it would 
have been rather useless without the amazing Schrijver's inequality ([5]). Note that 
the validity of Conjecture 4.1 also implies Asymptotic Lower Matching Conjecture. 
It would be great to prove Conjecture 4.1 using H-Stable polynomials. 

3. The following equality holds for the doubly-stochastic matrices K as in ( )53l) : 

F(K) = max CW(K,Q) 



7 A disproof of a positive correlation conjecture due 
to [Lu,Mohr,Szekely] 

Let A be n x n stochastic matrix, i.e. the rows of A are probabilistic distributions on 

{1, n}; (ei, e n ) is the standard basis in R n . 

Let V =: (Vij Ki) be a n-tuple of independent random vectors: 

Prob(Vi = e k ) = A(i, k); 1 < i, k < n. 

The distribution of the sum V\ + ... + V n coincides with the vector of the coefficients of 
the product polynomial 

Prod A ,Prod A (x 1 ,...,x n ) = Y[ A(hj) x j> 

l<i<n l<j<n 

i.e. the probability Prob(Vi + ... + V n = (cux, u n )) is the coefficient a Wli ... ia , n of the 
monomial ]li<i<n X T i n the polynomial Prod a- In particular, 

per(A) = Prob(Vi + ... + V n = e), (62) 
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where e = (1, 1, 1) is the vector of all ones. 

Notice that the expected value E(Vi + ... + V n ) = (ci, c n ), where Cj is the sum of the 
jth column of A. Thus in the doubly-stochastic case 

per(A) = Prob{Vx+...+V n = E(Vx+...+V n )) = Prob(\\V 1 +...+V n -E(V 1 + ...+V n )\\ 2 2 < 2), 

(63) 

and the lower bounds on the permanent of doubly-stochastic matrices can be viewed as 
concentration inequalities for sums of independent random vectors. This interpretation 
raises a number of natural questions: 

1. What are the lower bounds on Prob{\\V x + ... +V n - E(Vi + ... + V n )\\ 2 2 < R),R< 
n[n — 1) in the doubly-stochastic case? Van Der Waerden-Falikman-Egorychev 
gives the lower bound ^ ~ exp(-n) for R < 2. 

This question, albeit for distributions associated with H-Stable polynomials, was 
asked by the author in [10J. 

2. Is it possible to use this probabilistic interpretation to get new lower bounds, like 
([rjj in this paper? 

3. Is there a lower bound, similar to Van Der Waerden-Falikman-Egorychev, on Prob(\\Vi+ 
... + V n — E(Vi + ... + V n )\\\ < 2) for stochastic matrices,perhaps with some different, 
yet small, radius? 

4. The coefficients of the products polynomials PtocIa, A > 0, and of more general H- 
Stable and Strongly Log-Concave polynomials [11] . satisfy a lot of log-concave 
like inequalities. Perhaps this cab used to prove new concentration inequalies of 
the type we listed above? 

5. We invite the reader to raise more questions. 

Remark 7.1: We presented above very simple and effective "classical" generator to 
sample the distribution Dist = {a Wlj ... jWn : {ui\, oj n ) G Z™,u)i + ... + u n = n}, where 
<Vi,...,uj n are coefficients of the product polynomial PtocIa and A is a stochastic matrix. 
The similar problem for the doubly-stochastic polynomial 

Peru(x u ...,x n ) = per(UDiag(xi, ...,x n )U*), 

where U is nx n complex unitary matrix, is of major importance in Quantum Comput- 
ing. The generator in this paper can be viewed as a classical approximation: essentially, 
we approximate Peru by the lower bound Prods, where B(i,j) = \U(i, j)| 2 ; 1 < i, j < n. 
If p G Hom + (n, n) is doubly-stochastic and log-concave on R+ then its coefficients satisfy 
the (sharp) inequality 

pwx,... )W „ < n 

Ki<n 
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The permanental polynomials Peru(xi, x n ) have much veaker, yet sharp, upper bounds: 

<r TT ^ 

yui,...,c; n 2: 11 a;, ■ 
l<i<n M i 

Notice that as the permanental polynomial Peru is doubly-stochastic thus 

Per u (x 1 ,...,x n )> Y[ xi) xj > 0, 1 < j < n, 

l<i<n 

in other words per(Q) > det(Q) for PSD matrices Q >z 0. It is, of course, a well known 
result due to I. Schur. But our proof is much simpler and shorter than all previous ones. 



Define the following n events: 

NEi = {(V,, V n ) : Vt £ {Vj,j z}}; 1 < i < n. 

Equivalently 

per (A) = Prob^^NEi). (64) 

The authors of [21] noticed that Prob(NEi) = J2i<j<n 17^(1 ~ A(k, j) and conjec- 

tured the following beautiful positive correlation inequality for doubly-stochastic matrices 
A G Omega n : 

per(A) > G(A) =: J] Prob(EV$ = ]J £ A(i,j)H(l-A(kJ)7 (65) 

l<i<n l<i<n l<j<n kj^i 

It is easy to see that G(A) > F(A),A G £l n and G(A) = F(A) in the regular case, i.e. 
when A G r~ x RB{r, n); 1 < r < n. Therefore in this regular case the inequality (|65|) 
holds and is equivalent to the (Schrijver-bound) (J3J). 

Actually, in this regular a stronger correlational inequality follows from (122]) : 
ProKrWnJVfl) > G(1 ^^ (r) II ProbiEV,). 

^V) l<i<n 

Apparently the authors of [2 1 j did a substantial numerical validation of the conjecture 
on random matrices of modest size. 

Surprisingly, the Monomer-Dimer Problem provides a probabilistic counter-example. 
We will present finite families F n C tt n such that G(A) = Const, A G F n but the average 
with some weigths of the permanent over F n is exponentially smaller than Const. 



Remark 7.2: 



23 



1. One can ask for a Sidak-like [22] correlational inequality: 

per(A) = Prob(n 1 < j < n {\{Vi+...+V n ) j -l\ < 1}) > JJ Prob{{\{V l +...+V n ) J -l\ < l}),Ae fi n ? 

l<?'<n 

(66) 

It is easy to see that 

Prob({\(V 1 + ... + V n ) j -l\<l}) = E A(i,j)H(l-A(m,j). 

l<i<n m^i 

In the notations of Conjecture (14.11) : 

ProbdKV! + ... + V n )j - 1| < 1}) = q U) (h I)- 
So, the conjectured correlation inequality f[6"6"j) can be rewritten as 

per(A) > =: ( II E ^ i) II (* ~ A (k,j) ) = R %)(1, 1), A G fi B ? 

\l<j<n l<i<n m^i / l<i<« 

Notice that Conjecture (14. ip claims a smaller lower bound: instead of %)(1, ■ 1), 
it uses 

Cap(q (j) ) =: inf ?y)(a;i, • • • , Xj-i, ■■■■,x n ). 

X i >0 >Ul<i<„ X i = 1 



Similarly to [Lu,Mohr,Szekely] conjecture, ( 1661) holds in the regular case but fails 
in general: 

SD(A) > F(A),A G fi n and SD(y4) = G{A) = F(A) in the regular case. 

2. The Sidak Lemma for the gaussian vectors [22] plays crucial role in the recent 
Barvinok's bound |6j on the number of perfect matchings in general regular graphs 
without small cuts. 

3. Is there a direct, i.e probabilistic, way to prove correlational inequalities ( )65| 166]) in 
the regular case? What makes regular bipartite graphs so "correlationally" special? 
One possible answer is the following observation. 

Proposition 7.3: Let H-Stable polynomial p G Hom + (n,n) be r-regular ; i.e. 

p(e + e i t) = ( r + t ~ 1 ) r (67) 



(Notice that r-regular polynomials are doubly-stochastic, therefore Cap(p) — 1.) 
Recall the definition of polynomials qu\ G Hom + (n — 1, n — 1): 

d 

q {j )(x k :k^j) =: — p{x\, ...,x n ):Xj = 0. 
Then the polynomials (G(r)) -1 q^ are doubly- stochastic, 1 < j < n. 
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Proof: It is easy to see that a polynomial q G Hom + (n — l,n — 1) is doubly- 
stochastic iff Cap(q) > 1 and q(e) = 1. 
It follows from (EZD that (G(r)) -1 g (j) (e) = 1, 1 < j < n. 
Finally, Theorem 4.10 in [5] gives the lower bound Cap((G(r)) _1 q^) > 1. | 

Corollary 7.4: Conjecture holds for r-regular H-Stable polynomials. 



7.1 The Construction 

Let us sketch first our strategy: we know that for dobly- stochastic matrices A G Vt n 

G(A) > F{a);SD(A) > F{A);per{A) > F(A). 

We are after matrices where F(A) is asymptoticaly close to the permanent, but both G(A) 
and SD(A) are much greater than F(A). 

It is conceivable that there is an explicit counter-example even for n — 3. But this paper 
proves only the existence of such matrices by the probabilistic method for large enough 
n. We actually don't need the nontrivial inequality per(A) > F(A),A G Q n , but it has 
motivated the use of the monomer-dimer problem below. 

Consider either of two random models in RI(r, n), say a random matrix BM(r, n) G 
RI(r,n). In induces a conditional distribution on RB(r,n), i.e. a random matrix 
CBM(r, n) G RB(r, n) with the distribution 

x a ^/ \ \ prob(CBM(r,n) = Ae RB(r,n)) 
Prob(CBM(r,n) = A G RB(r,n)) = y \ > \[ ,) . 

prob{BM{r,n) G RB(r,n)\ 

The Wanless argument gives that 

Hm \og(E(per m (CBM(r,n)))) < 
n->oo,f^te[o,i] n ~ 

Let K G ^2n-m be the following random doubly-stochastic matrix 

/ aCBM(r, n) bJ n , a _ rn \ . . 

K ~{ (bJ n , n - m ) T J' (68) 

a = -,t = —;b=-. By the direct inspection, we get that 



f 1 1\ n /1 + \ n (l~ *) 

g(k) = lt(i- -y-'a - -W 1 -*)) + (i - tvi - -T' 1 ) ( (i - -) n(1 -* ) - 1 (i - -y^y 

r n n J \ n r 
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and 

/ f\(r-t)n f i x (n-l)2n(l-i) 

= (1-1) 1- ~ 



r / \ n 



Recall that 

per m{ CBM(r,n)) = (69) 



The conjecture (Jbo]) would imply, if true, that 

per m {CBM{r,n))a m b 2{n ~ m \{n-m)\f > G(K). (70) 

Which would give 

f(r, n, m) =: E{per m {CBM(r, n))a m b 2(n - m) {{n - m)\) 2 ) > G(K). (71) 

But 



logf fir, n, m)) , , t s 

lim SUV - - ;; = (r-t) log 1 - - - 2 1 - i) =: M(t) 

iwoo,2^ 6 [o,i] n r 



And 



Um log(G(X)) = log ( t n _ *)r-i e -(i-t) + (1 _ t)e -i > j_( 1 _ f )2 +r / 1 _ £ ) bg f 1 - ^ = : 
n^oo,s:->te[o,i] n V r / \ r) 

The final observation is the following strict inequality 

S(t) > M(t),0 <t < 1, 

which follows from the strict concavity of the logarithm and the inequality 

(1 - -) r - 1 e- (1 - <) > e~\0<t < 1. 
r 



7.2 A disproof of Sidak-like positive correlation conjecture ( 1661) 



By the direct inspection, we get that 



i v (n(l-i) , x i 

sd{k) = ( i - — — ) ( (i - -y-\i - -^—) n ^ 



n — lj V r n — 1 



Which gives the limit 



lim hg{G{K)) = (r - 1) log(l - *) - 2(1 - t) =: L{t). 

n->oo,a->te[o,i] n r 

As log(l - £) < and r — 1 < r — * for < t < 1, it follows that 

L(t) > M(t),0 <t<l. 
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8 Credits and Conclusion 



The Definition (12.1 ft apparently has rich and important stat-physics meaning centered 
around so called Bethe Approximation. Bethe Approximation is also one of the main 
Heuristics in modern practice of Machine Learning, especially in inference on graphi- 
cal models (it is quite rare for a Heuristic from Machine Learning to have such amazing 
proof power) . 

Although this stat-physics background was not used in the current paper, it and its de- 
velopers(to be named in the final version) deserve a lot of praise: don't forget that many 
very good mathematicians have completely overlooked seemingly simple Theorem 12.21 
It would be fantastic to have a rigorous and readable proof of Theorem 12.21 based on 
new(age) methods. The author is a bit skeptical at this point: any such proof would es- 
sentially reprove very hard Schrijver's permanental bound. The other avenue is to better 
understand and possibly to simplify the original Schrijver's proof, perhaps it has some 
deep stat-physics meaning. 

It is possible that one can use higher order approximation(the Bethe Approximation be- 
ing of order two, it involves marginals of subsets of cardinality two). Luckily, this order 
two case is covered by Schrijver's lower bound (JSJ). The higher order cases will probably 
need new lower bounds (involving subpermanents?). It looks like a beginning of a beau- 
tiful(and hard) new line of research. 

Our proof of Friedland's monomer-dimer entropy conjecture illustrates the power of 
Theorem 12.21 Interestingly, monomer-dimer entropy is the classical topic in stat- 
physics. The author is not a physicist, passionately so, even after 11 years at Los Alamos. 
Yet, there is a certain justice in the coincidence that some roots of this paper can be 
traced back to Hans Bethe... what a great group of creative people worked in New Mexico 
back then! 
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